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Abstract. The first measurement of the top quark pair production cross section in the fully hadronic channel at 
a center-of-mass energy of ^ = 7 TeV is presented. The measurement has been performed using an integrated 
luminosity of 1.09 fb -1 , collected with the CMS detector. The reconstruction of the it candidates is performed 
after a cut-based event selection using a kinematic fit. A data-driven technique is used to estimate the dominant 
background from QCD multijet production. The cross section is determined from a fit to the top quark mass. The 
cross section measurement yields cr(tt) = 136 ± 20(stat.) ± 40(sys.) ± 8(lumi.) pb. This result is consistent with 
an independent measurement in the same channel, with the measurements in other decay channels and with the 
Standard Model predictions. 



1 Introduction 

Measurements of the top quark pair production cross sec- 
tion in proton-proton collisions at the LHC provide im- 
portant tests of our understanding of the top quark pro- 
duction mechanism and can also be used in new physics 
searches. Recent CMS |1| measurements, making use of 
the full 2011 dataset corresponding to an integrated lu- 
minosity greater than 1 fb" 1 , have been performed in the 
dilepton channel, including r leptons, and in the lepton+jets 
channel B2I3I4I . 

Here the first cross section measurement in the fully 
hadronic decay channel with the CMS detector is presented 
1151 . The fully hadronic decay channel has the largest sig- 
nal yield compared to the dilepton and lepton+jets chan- 
nels. However, with only jets in the final state, this channel 
is dominated by the nearly identical QCD multijet back- 
ground. This measurement demonstrates the good under- 
standing of the CMS detector and is, therefore, an impor- 
tant milestone. This measurement was combined with the 
more recent measurements in the other channels, providing 
the most precise CMS measurement obtained to date. 



2 Event Selection 

To select fully hadronically decaying top quark pairs the 
candidate events are required to have at least four jets with 
p T > 60 GeV/c, a fifth jet with p T > 50 GeV/c, and a sixth 
jet with pr > 40 GeV/c. All jets are required to be within 
an absolute pseudorapidity \rj\ of 2.4, in the tracker accep- 
tance. Additional jets are considered only if they have a 
transverse momentum of more than 30 GeV/c. 

Tagging jets originating from bottom quarks (^-tagging) 
is achieved with a robust ^-tagging algorithm based on the 
reconstruction of secondary vertices (6). This algorithm 
yields an efficiency of 38±4% while having a mis-tag rate 
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of 0.12±0.02%. At least two b-tagged jets are required in 
each event. 



2.1 Kinematical Fit 

For the final selection of candidate top quark events a kine- 
matic (least-squares) fit is applied. It exploits the charac- 
teristic topology of top quark events: two W bosons with 
a mass of 80.4 GeV/c 2 that can be reconstructed from the 
untagged jets and two top quarks that can be reconstructed 
from the W bosons and the b-tagged jets. The masses of 
the two top quarks are assumed to be equal, but not fixed, 
in order to avoid any bias. 

To find the right combination of jets, the fit procedure 
is repeated for every distinguishable jet permutation. This 
is done using all (six or more) jets that passed the selec- 
tion criteria. All b-tagged jets are taken as bottom quark 
candidates, the remaining jets are taken as light quark can- 
didates. If the fit converges for more than one of the possi- 
ble jet permutations, the one with the minimum^ 2 is cho- 
sen. After the kinematic fit, all events with a fit probability 
P(X 2 ) > 0.01 are accepted. In the kinematic fit gaussian 
resolutions are used for the jets. 

The number of events in data passing each selection 
step and the expected signal fraction using a Standard Model 
tt production cross section of 163 pb [7| are given in Table 

m 



3 Signal Extraction 

The number of signal events after the final selection is de- 
termined via an unbinned maximum likelihood fit to the 
reconstructed top quark mass distribution, obtained from 
the kinematic fit. The shapes used in the fit for the signal 
and background distributions are derived from simulation 
and a data-driven estimate, respectively. The resulting dis- 
tribution is shown in Figure [T] 
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Table 1. Number of events and the expected signal fraction in the 
data sample after each selection step. The expected signal fraction 
is taken from the simulation, assuming a cross section of 163 pb. 



Selection Step 


Events 


Signal Fraction 


At least 6 jets 


248109 


2% 


At least 2 b-tags 


6905 


17% 


Kinematical Fit 


1620 


32% 
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Fig. 1. Result of the fit of the reconstructed top quark mass for the 
tt simulation (solid line) and the multi-jet QCD estimated from 
data (dashed line). The uncertainty stated on the signal fraction 
f sig is only statistical. 



4 QCD Background Estimation 

The background from QCD multijet events is estimated 
from events with six or more jets of which exactly zero 
are ^-tagged. In this region of the phase space the signal 
contribution is below 1%. As the kinematics of ^-tagged 
jets and non-Z?-tagged jets is different, the events that do 
not contain any b-tag are weighted such that they repro- 
duce the kinematics of the ^-tagged jets. This is done by 
deriving a ratio: 



R(Pt, M) = 



N(p T M,b) 
N(p T , \n\,not b) 



(1) 



where N(pr, \t]\, b) is the number of fo-tagged jets in events 
with at least one jet coming from b quarks and 
N(pTi\n\>not b) is the number of untagged jets from all 
events in data. The ratio R gives the probability of ^-tagging 
a jet as a function of its transverse momentum pj and ab- 
solute pseudorapidity \r\\. The kinematic fit is performed 
on all events with zero /?-tagged jets, assuming every jet as 
bottom quark candidate. For all permutations with P(x 2 ) ^ 
0.01 an event weight 



w=R(p b T ,\T}\ b )xR(p b T , 



(2) 



is calculated depending on the pj and \r\\ of the two jets 
assigned to the bottom quarks in the kinematic fit hypothe- 
sis. The weighted events are used for the estimation of the 
QCD multijet background that passes the event selection 
criteria with two ^-tagged jets. 



5 Results 

The inclusive cross section for top quark pair production 
in the fully hadronic decay channel has been derived from 



a- tt = 



fsig-N 

£ ' Lj nt 



(3) 



The fraction of signal events, f„ 9 , is estimated by the fit of 
the top quark mass distribution and is found to be 25.0%. 
The number of events N seen in the data after the full se- 
lection is 1620 and the efficiency for selecting tt events e, 
which is determined from simulation, is 0.22%. The lat- 
ter includes the correction factors for the b-tagging effi- 
ciency, mis-tag rate and trigger efficiency found in data. 
The integrated luminosity of the data sample is L,-„, = 1.09 
fb -1 . With these numbers the tt production cross section 
cr,f = 136 ± 20(sfaf.) ± 40(si/s.) + 8(lumi.) pb is obtained. 
The total relative uncertainty corresponds to ±33%. 

To determine the systematic uncertainties on the cross 
section measurement, samples of simulated events with mod- 
ified parameters have been used. For each variation the 
change in the efficiencies and the fitted signal fraction, and 
hence the cross section, has been studied using pseudo- 
experiments. An overview of the main sources of system- 
atic uncertainties is given in Table [2] 



Table 2. Summary of the main sources of systematic uncertain- 
ties with their values. 



Systematic Error 


Relative Uncertainty 


6-tagging 


15.7% 


Jet Energy Scale 


13.5% 


Background 


12.2% 


Q 2 scale 


8.7% 



6 Cross Check Analysis 

A second measurement has been performed as a cross check. 
For this analysis the kinematic properties of signal and 
background events are characterized to develop a kinematic 
selection based on a neural network. Moreover, the QCD 
multijet background is estimated from data using the in- 
formation from the kinematic variables of dijet pairs that 
passed loose ^-tagging criteria and extrapolating to the sig- 
nal region with tighter /?-tagging criteria and higher jet 
multiplicities. 

The neural network has been trained using in total eight 
kinematic variables. These variables are sphericity, central- 
ity, aplanarity, the sum of transverse jet energies (including 
and without the first two jets), the transverse jet energies 
weighted with sin 2 6 (6 being the angle enclosed by the jet 
and the beam-axis in the center-of-mass frame of the mul- 
tijet final state) for the first and second jets, as well as the 
mean of the remaining jets. The events are all required to 
have a jet multiplicity of 6 < Nj et < 8, where the jets se- 
lection criteria are the same as for the main analysis. The 
outcome of the training process is shown in Figure [2] The 
selected events are required to have a neural network out- 
put NN 0Ut > 0.65. 
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Fig. 2. Neural network output for simulated tt events and multi- 
jet events from data shown normalized to unity. 



To improve the purity of the selected sample two b-tags 
are required using an algorithm different from the on used 
in the main analysis, with an efficiency of 63 + 4% and 
mis-tag rate of 1.8 + 0.2%. The assumption for the data- 
driven QCD multijet background estimate is that the bb 
dynamics in events with six or more jets can be appropri- 
ately inferred from a control sample of events with four or 
five jets. Three variables are used to parametrize the prob- 
ability that both jets in a given dijet pair are both ^-tagged: 
the average transverse momentum < pj >, the average 
pseudorapidity < \tj\ >, and the opening angle between the 
two jets AR. 

Events with six or more jets have by definition more 
dijet combinations than events with four or five jets. In or- 
der to avoid a bias due to this fact the probability that a 
given jet pair with both jets having a loose fo-tag also has 
both jets with a medium fe-tag was parametrized. With this 
parametrization, referred to as the ratio R^, the expected 
amount of background pairs of medium- tagged jets is ob- 
tained by weighting each event with 

w = Y j RT(<Pt>,<\ti\>,4R) (4) 

where the sum runs over all loose-tagged jet pairs in the 
event with at least two loose fo-tags. The tt signal is ex- 
tracted from the reconstructed top quark mass. For the event 
reconstruction a kinematic fit is used, similar to the one 
used for the main analysis, with the requirements of x 2 < 
40 and a minimum opening angle between the ^-tagged 
jets of AR > 1.2. After the full event selection 937 events 
remain from which 1125 top quark masses can be recon- 
structed. Three templates are used to describe the recon- 
structed top quark mass distribution. 

The background template is obtained from data by weight- 
ing it with the double-tag probability. The signal template 
is acquired directly from simulation. A third template rep- 
resenting signal events behaving as background is also de- 
rived from simulation, but weighted with the double-tag 
probability. This is done in order to correct for contamina- 
tion from signal events in the control region. Finally, the 
cross section is extracted from a binned maximum likeli- 
hood fit of these three templates to the distribution mea- 
sured from data. Figure [3] shows the comparison between 
the reconstructed top mass from data and the expected sig- 



nal and background distributions normalized to the yields 
found in the fit. The measured cross section is <x„- = 157 + 
30(stat.) + 47(sys.) + 8(lumi.) pb with a signal fraction of 
40%. 

The value is in agreement with the Standard Model pre- 
dictions and with the value obtained by the main analysis. 
This provides an important and independent cross check of 
the measurement. 
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Fig. 3. Reconstructed top quark mass m lop for i-tagged jet combi- 
nations observed in data (circles) passing the full event selection. 
For comparison, the expected background (dashed line) and tt 
signal (solid line) are shown, both normalized to the yields from 
the fit. 



7 Conclusions 

A first measurement of the top quark pair production cross 
section in the fully hadronic decay channel at a center-of- 
mass energy of 7 TeV has been presented. The measure- 
ment results in a cross section of 

o-g = 136 + 20(sfaf.) + 40(sys.) + S(lumi.) pb (5) 

A cross check analysis, using a neural network based 
event selection and a different QCD multijet background 
estimate, gives a result compatible within the errors. Both 
results are consistent with earlier CMS measurements in 
the dilepton and lepton+jets decay channels and also with 
the Standard Model approximate NNLO theoretical calcu- 
lations II7I8I91I . 
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